Rank in Wordlist | Frequency | Word |
---|---|---|
21476 | 6 | 2,2 |
30870 | 3 | 2,500 |
31109 | 3 | 7,500 |
37864 | 2 | 1,144 |
37865 | 2 | 1,400 |
37866 | 2 | 1,500 |
38083 | 2 | 2,600 |
38369 | 2 | 3,021 |
38403 | 2 | 33,000 |
38413 | 2 | 352,000 |
Rank in Wordlist | Frequency | Word |
---|---|---|
2503 | 251 | لاہور(اُردوپوائنٹ |
2661 | 229 | آباد(اُردوپوائنٹ |
2929 | 200 | آباد(اُردو |
3349 | 167 | کراچی(اُردوپوائنٹ |
4478 | 106 | لاہور(اُردو |
4564 | 102 | لیگ(ن |
5828 | 69 | کراچی(اُردو |
5931 | 67 | پشاور(اُردوپوائنٹ |
9371 | 30 | لیگ(ق |
9897 | 27 | لندن(اُردوپوائنٹ |
Rank in Wordlist | Frequency | Word |
---|---|---|
2318 | 280 | 2012ء)پاکستان |
4575 | 101 | 2012ء)بالی |
4894 | 91 | 2012ء)کراچی |
6177 | 62 | 2012ء)وزیر |
6723 | 54 | 2012ء)سپریم |
7038 | 50 | 2012ء)پنجاب |
7201 | 48 | 2012ء)بھارت |
7202 | 48 | 2012ء)لاہور |
7649 | 43 | 2012ء)بھارتی |
7857 | 41 | 2012ء)قومی |
Rank in Wordlist | Frequency | Word |
---|---|---|
53316 | 1 | 1.5% |
53331 | 1 | 10%کے |
53499 | 1 | 11.85%ہوچکا |
55737 | 1 | 36.67%ووٹ |
56217 | 1 | 50%بڑھیگی۔ |
56288 | 1 | 53%جگہیں |
56289 | 1 | 53.35%جبکہ |
57211 | 1 | 97%کام |
112058 | 1 | ہ%زار |
Rank in Wordlist | Frequency | Word |
---|---|---|
57308 | 1 | AT&T |
59264 | 1 | S&P |
Rank in Wordlist | Frequency | Word |
---|---|---|
16407 | 11 | ہیں'۔ |
21386 | 7 | ہے'۔ |
26729 | 4 | Moody's |
31308 | 3 | d'Or |
32846 | 3 | تھا'۔ |
37415 | 3 | ہوں'۔ |
38713 | 2 | America's |
39041 | 2 | Poor's |
39084 | 2 | Sotheby's |
51817 | 2 | گی'۔ |
Rank in Wordlist | Frequency | Word |
---|---|---|
15700 | 11 | آباد/ |
21528 | 6 | 9/11 |
23106 | 6 | پشاور/ |
23697 | 5 | 26/11 |
26740 | 4 | SIU/CIA |
28038 | 4 | دہلی/ |
31414 | 3 | آباد/لاہور/ |
34684 | 3 | لاہور/اسلام |
34685 | 3 | لاہور/فیصل |
37900 | 2 | 11/9 |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots